Overview
Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 1000 |
| Missing cells | 95 |
| Missing cells (%) | 0.7% |
| Duplicate rows | 33 |
| Duplicate rows (%) | 3.3% |
| Total size in memory | 101.7 KiB |
| Average record size in memory | 104.1 B |
Variable types
| Text | 5 |
|---|---|
| Numeric | 4 |
| Categorical | 4 |
| Dataset has 33 (3.3%) duplicate rows | Duplicates |
km_driven is highly overall correlated with year | High correlation |
selling_price is highly overall correlated with transmission and 1 other fields | High correlation |
transmission is highly overall correlated with selling_price | High correlation |
year is highly overall correlated with km_driven and 1 other fields | High correlation |
seller_type is highly imbalanced (52.7%) | Imbalance |
mileage has 19 (1.9%) missing values | Missing |
engine has 19 (1.9%) missing values | Missing |
max_power has 19 (1.9%) missing values | Missing |
torque has 19 (1.9%) missing values | Missing |
seats has 19 (1.9%) missing values | Missing |
Reproduction
| Analysis started | 2025-11-30 13:12:45.122969 |
|---|---|
| Analysis finished | 2025-11-30 13:12:46.001979 |
| Duration | 0.88 seconds |
| Software version | ydata-profiling vv4.18.0 |
| Download configuration | config.json |
Variables
name
Text
| Distinct | 621 |
|---|---|
| Distinct (%) | 62.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
Length
| Max length | 49 |
|---|---|
| Median length | 39 |
| Mean length | 24.857 |
| Min length | 11 |
Unique
| Unique | 440 ? |
|---|---|
| Unique (%) | 44.0% |
Sample
| 1st row | Mahindra Xylo E4 BS IV |
|---|---|
| 2nd row | Tata Nexon 1.5 Revotorq XE |
| 3rd row | Honda Civic 1.8 S AT |
| 4th row | Honda City i DTEC VX |
| 5th row | Tata Indica Vista Aura 1.2 Safire BSIV |
| Value | Count | Frequency (%) |
| maruti | 290 | 6.2% |
| hyundai | 198 | 4.2% |
| tata | 106 | 2.3% |
| mahindra | 90 | 1.9% |
| swift | 83 | 1.8% |
| diesel | 83 | 1.8% |
| bsiv | 79 | 1.7% |
| vxi | 74 | 1.6% |
| 1.2 | 71 | 1.5% |
| plus | 64 | 1.4% |
| Other values (495) | 3549 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3687 | 14.8% | |
| a | 1852 | 7.5% |
| i | 1631 | 6.6% |
| t | 1253 | 5.0% |
| r | 1094 | 4.4% |
| o | 1010 | 4.1% |
| n | 934 | 3.8% |
| e | 890 | 3.6% |
| u | 738 | 3.0% |
| S | 701 | 2.8% |
| Other values (57) | 11067 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 24857 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3687 | 14.8% | |
| a | 1852 | 7.5% |
| i | 1631 | 6.6% |
| t | 1253 | 5.0% |
| r | 1094 | 4.4% |
| o | 1010 | 4.1% |
| n | 934 | 3.8% |
| e | 890 | 3.6% |
| u | 738 | 3.0% |
| S | 701 | 2.8% |
| Other values (57) | 11067 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 24857 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3687 | 14.8% | |
| a | 1852 | 7.5% |
| i | 1631 | 6.6% |
| t | 1253 | 5.0% |
| r | 1094 | 4.4% |
| o | 1010 | 4.1% |
| n | 934 | 3.8% |
| e | 890 | 3.6% |
| u | 738 | 3.0% |
| S | 701 | 2.8% |
| Other values (57) | 11067 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 24857 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3687 | 14.8% | |
| a | 1852 | 7.5% |
| i | 1631 | 6.6% |
| t | 1253 | 5.0% |
| r | 1094 | 4.4% |
| o | 1010 | 4.1% |
| n | 934 | 3.8% |
| e | 890 | 3.6% |
| u | 738 | 3.0% |
| S | 701 | 2.8% |
| Other values (57) | 11067 |
year
Real number (ℝ)
High correlation
| Distinct | 24 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2013.681 |
| Minimum | 1995 |
|---|---|
| Maximum | 2020 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 1995 |
|---|---|
| 5-th percentile | 2006 |
| Q1 | 2011 |
| median | 2014 |
| Q3 | 2017 |
| 95-th percentile | 2019 |
| Maximum | 2020 |
| Range | 25 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 4.0121486 |
|---|---|
| Coefficient of variation (CV) | 0.001992445 |
| Kurtosis | 1.2158841 |
| Mean | 2013.681 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -1.0223557 |
| Sum | 2013681 |
| Variance | 16.097336 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=24)
| Value | Count | Frequency (%) |
| 2017 | 134 | |
| 2016 | 106 | |
| 2015 | 96 | |
| 2018 | 91 | |
| 2011 | 85 | |
| 2012 | 83 | |
| 2014 | 79 | |
| 2013 | 76 | |
| 2019 | 64 | |
| 2010 | 49 | 4.9% |
| Other values (14) | 137 |
| Value | Count | Frequency (%) |
| 1995 | 1 | 0.1% |
| 1998 | 1 | 0.1% |
| 1999 | 5 | 0.5% |
| 2000 | 1 | 0.1% |
| 2001 | 2 | 0.2% |
| 2002 | 4 | 0.4% |
| 2003 | 8 | 0.8% |
| 2004 | 10 | |
| 2005 | 10 | |
| 2006 | 20 |
| Value | Count | Frequency (%) |
| 2020 | 4 | 0.4% |
| 2019 | 64 | |
| 2018 | 91 | |
| 2017 | 134 | |
| 2016 | 106 | |
| 2015 | 96 | |
| 2014 | 79 | |
| 2013 | 76 | |
| 2012 | 83 | |
| 2011 | 85 |
selling_price
Real number (ℝ)
High correlation
| Distinct | 274 |
|---|---|
| Distinct (%) | 27.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 617901.04 |
| Minimum | 31000 |
|---|---|
| Maximum | 6000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 31000 |
|---|---|
| 5-th percentile | 100000 |
| Q1 | 250000 |
| median | 434999 |
| Q3 | 670000 |
| 95-th percentile | 1904049 |
| Maximum | 6000000 |
| Range | 5969000 |
| Interquartile range (IQR) | 420000 |
Descriptive statistics
| Standard deviation | 758553.86 |
|---|---|
| Coefficient of variation (CV) | 1.22763 |
| Kurtosis | 21.438457 |
| Mean | 617901.04 |
| Median Absolute Deviation (MAD) | 205000 |
| Skewness | 4.2148309 |
| Sum | 6.1790104 × 108 |
| Variance | 5.7540396 × 1011 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 300000 | 29 | 2.9% |
| 350000 | 28 | 2.8% |
| 600000 | 28 | 2.8% |
| 550000 | 25 | 2.5% |
| 650000 | 24 | 2.4% |
| 400000 | 24 | 2.4% |
| 250000 | 22 | 2.2% |
| 500000 | 22 | 2.2% |
| 750000 | 22 | 2.2% |
| 450000 | 16 | 1.6% |
| Other values (264) | 760 |
| Value | Count | Frequency (%) |
| 31000 | 1 | 0.1% |
| 33983 | 1 | 0.1% |
| 35000 | 1 | 0.1% |
| 40000 | 1 | 0.1% |
| 45000 | 5 | |
| 46000 | 1 | 0.1% |
| 50000 | 2 | 0.2% |
| 52000 | 2 | 0.2% |
| 55000 | 3 | |
| 55599 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 6000000 | 2 | 0.2% |
| 5500000 | 5 | |
| 5400000 | 2 | 0.2% |
| 5150000 | 3 | 0.3% |
| 4100000 | 1 | 0.1% |
| 3800000 | 2 | 0.2% |
| 3750000 | 1 | 0.1% |
| 3400000 | 1 | 0.1% |
| 3251000 | 1 | 0.1% |
| 3200000 | 8 |
km_driven
Real number (ℝ)
High correlation
| Distinct | 260 |
|---|---|
| Distinct (%) | 26.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 71393.341 |
| Minimum | 1303 |
|---|---|
| Maximum | 375000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 1303 |
|---|---|
| 5-th percentile | 9190 |
| Q1 | 37000 |
| median | 61500 |
| Q3 | 100000 |
| 95-th percentile | 160000 |
| Maximum | 375000 |
| Range | 373697 |
| Interquartile range (IQR) | 63000 |
Descriptive statistics
| Standard deviation | 48486.219 |
|---|---|
| Coefficient of variation (CV) | 0.67914203 |
| Kurtosis | 3.8337561 |
| Mean | 71393.341 |
| Median Absolute Deviation (MAD) | 28500 |
| Skewness | 1.4228571 |
| Sum | 71393341 |
| Variance | 2.3509134 × 109 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 120000 | 66 | 6.6% |
| 70000 | 58 | 5.8% |
| 60000 | 55 | 5.5% |
| 80000 | 54 | 5.4% |
| 40000 | 46 | 4.6% |
| 50000 | 44 | 4.4% |
| 90000 | 38 | 3.8% |
| 110000 | 35 | 3.5% |
| 100000 | 33 | 3.3% |
| 30000 | 27 | 2.7% |
| Other values (250) | 544 |
| Value | Count | Frequency (%) |
| 1303 | 1 | 0.1% |
| 2000 | 7 | |
| 2388 | 1 | 0.1% |
| 2600 | 1 | 0.1% |
| 3100 | 1 | 0.1% |
| 3500 | 2 | 0.2% |
| 3564 | 1 | 0.1% |
| 4000 | 1 | 0.1% |
| 4337 | 1 | 0.1% |
| 5000 | 9 |
| Value | Count | Frequency (%) |
| 375000 | 1 | |
| 300000 | 2 | |
| 298000 | 1 | |
| 291000 | 1 | |
| 270000 | 1 | |
| 265000 | 1 | |
| 264000 | 1 | |
| 260000 | 1 | |
| 250000 | 1 | |
| 248000 | 1 |
fuel
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
| Diesel | |
|---|---|
| Petrol | |
| CNG | 5 |
| LPG | 4 |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.973 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Diesel |
|---|---|
| 2nd row | Diesel |
| 3rd row | Petrol |
| 4th row | Diesel |
| 5th row | Petrol |
Common Values
| Value | Count | Frequency (%) |
| Diesel | 534 | |
| Petrol | 457 | |
| CNG | 5 | 0.5% |
| LPG | 4 | 0.4% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| diesel | 534 | |
| petrol | 457 | |
| cng | 5 | 0.5% |
| lpg | 4 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1525 | |
| l | 991 | |
| D | 534 | 8.9% |
| i | 534 | 8.9% |
| s | 534 | 8.9% |
| P | 461 | 7.7% |
| t | 457 | 7.7% |
| r | 457 | 7.7% |
| o | 457 | 7.7% |
| G | 9 | 0.2% |
| Other values (3) | 14 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5973 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1525 | |
| l | 991 | |
| D | 534 | 8.9% |
| i | 534 | 8.9% |
| s | 534 | 8.9% |
| P | 461 | 7.7% |
| t | 457 | 7.7% |
| r | 457 | 7.7% |
| o | 457 | 7.7% |
| G | 9 | 0.2% |
| Other values (3) | 14 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5973 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1525 | |
| l | 991 | |
| D | 534 | 8.9% |
| i | 534 | 8.9% |
| s | 534 | 8.9% |
| P | 461 | 7.7% |
| t | 457 | 7.7% |
| r | 457 | 7.7% |
| o | 457 | 7.7% |
| G | 9 | 0.2% |
| Other values (3) | 14 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5973 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1525 | |
| l | 991 | |
| D | 534 | 8.9% |
| i | 534 | 8.9% |
| s | 534 | 8.9% |
| P | 461 | 7.7% |
| t | 457 | 7.7% |
| r | 457 | 7.7% |
| o | 457 | 7.7% |
| G | 9 | 0.2% |
| Other values (3) | 14 | 0.2% |
seller_type
Categorical
Imbalance
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
| Individual | |
|---|---|
| Dealer | |
| Trustmark Dealer | 28 |
Length
| Max length | 16 |
|---|---|
| Median length | 10 |
| Mean length | 9.628 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Individual |
|---|---|
| 2nd row | Individual |
| 3rd row | Individual |
| 4th row | Individual |
| 5th row | Individual |
Common Values
| Value | Count | Frequency (%) |
| Individual | 837 | |
| Dealer | 135 | 13.5% |
| Trustmark Dealer | 28 | 2.8% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| individual | 837 | |
| dealer | 163 | 15.9% |
| trustmark | 28 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 1674 | |
| i | 1674 | |
| a | 1028 | |
| l | 1000 | |
| u | 865 | |
| I | 837 | |
| v | 837 | |
| n | 837 | |
| e | 326 | 3.4% |
| r | 219 | 2.3% |
| Other values (7) | 331 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9628 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| d | 1674 | |
| i | 1674 | |
| a | 1028 | |
| l | 1000 | |
| u | 865 | |
| I | 837 | |
| v | 837 | |
| n | 837 | |
| e | 326 | 3.4% |
| r | 219 | 2.3% |
| Other values (7) | 331 | 3.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9628 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| d | 1674 | |
| i | 1674 | |
| a | 1028 | |
| l | 1000 | |
| u | 865 | |
| I | 837 | |
| v | 837 | |
| n | 837 | |
| e | 326 | 3.4% |
| r | 219 | 2.3% |
| Other values (7) | 331 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9628 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| d | 1674 | |
| i | 1674 | |
| a | 1028 | |
| l | 1000 | |
| u | 865 | |
| I | 837 | |
| v | 837 | |
| n | 837 | |
| e | 326 | 3.4% |
| r | 219 | 2.3% |
| Other values (7) | 331 | 3.4% |
transmission
Categorical
High correlation
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
| Manual | |
|---|---|
| Automatic |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 6.369 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Manual |
|---|---|
| 2nd row | Manual |
| 3rd row | Automatic |
| 4th row | Manual |
| 5th row | Manual |
Common Values
| Value | Count | Frequency (%) |
| Manual | 877 | |
| Automatic | 123 | 12.3% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| manual | 877 | |
| automatic | 123 | 12.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1877 | |
| u | 1000 | |
| M | 877 | |
| n | 877 | |
| l | 877 | |
| t | 246 | 3.9% |
| A | 123 | 1.9% |
| o | 123 | 1.9% |
| m | 123 | 1.9% |
| i | 123 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6369 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1877 | |
| u | 1000 | |
| M | 877 | |
| n | 877 | |
| l | 877 | |
| t | 246 | 3.9% |
| A | 123 | 1.9% |
| o | 123 | 1.9% |
| m | 123 | 1.9% |
| i | 123 | 1.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6369 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1877 | |
| u | 1000 | |
| M | 877 | |
| n | 877 | |
| l | 877 | |
| t | 246 | 3.9% |
| A | 123 | 1.9% |
| o | 123 | 1.9% |
| m | 123 | 1.9% |
| i | 123 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6369 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1877 | |
| u | 1000 | |
| M | 877 | |
| n | 877 | |
| l | 877 | |
| t | 246 | 3.9% |
| A | 123 | 1.9% |
| o | 123 | 1.9% |
| m | 123 | 1.9% |
| i | 123 | 1.9% |
owner
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
| First Owner | |
|---|---|
| Second Owner | |
| Third Owner | |
| Fourth & Above Owner | 27 |
| Test Drive Car | 1 |
Length
| Max length | 20 |
|---|---|
| Median length | 11 |
| Mean length | 11.524 |
| Min length | 11 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | First Owner |
|---|---|
| 2nd row | First Owner |
| 3rd row | First Owner |
| 4th row | First Owner |
| 5th row | Second Owner |
Common Values
| Value | Count | Frequency (%) |
| First Owner | 623 | |
| Second Owner | 278 | |
| Third Owner | 71 | 7.1% |
| Fourth & Above Owner | 27 | 2.7% |
| Test Drive Car | 1 | 0.1% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| owner | 999 | |
| first | 623 | |
| second | 278 | 13.5% |
| third | 71 | 3.5% |
| fourth | 27 | 1.3% |
| 27 | 1.3% | |
| above | 27 | 1.3% |
| test | 1 | < 0.1% |
| drive | 1 | < 0.1% |
| car | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 1722 | |
| e | 1306 | |
| n | 1277 | |
| 1055 | ||
| O | 999 | |
| w | 999 | |
| i | 695 | |
| t | 651 | 5.6% |
| F | 650 | 5.6% |
| s | 624 | 5.4% |
| Other values (14) | 1546 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 11524 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 1722 | |
| e | 1306 | |
| n | 1277 | |
| 1055 | ||
| O | 999 | |
| w | 999 | |
| i | 695 | |
| t | 651 | 5.6% |
| F | 650 | 5.6% |
| s | 624 | 5.4% |
| Other values (14) | 1546 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 11524 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 1722 | |
| e | 1306 | |
| n | 1277 | |
| 1055 | ||
| O | 999 | |
| w | 999 | |
| i | 695 | |
| t | 651 | 5.6% |
| F | 650 | 5.6% |
| s | 624 | 5.4% |
| Other values (14) | 1546 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 11524 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 1722 | |
| e | 1306 | |
| n | 1277 | |
| 1055 | ||
| O | 999 | |
| w | 999 | |
| i | 695 | |
| t | 651 | 5.6% |
| F | 650 | 5.6% |
| s | 624 | 5.4% |
| Other values (14) | 1546 |
mileage
Text
Missing
| Distinct | 237 |
|---|---|
| Distinct (%) | 24.2% |
| Missing | 19 |
| Missing (%) | 1.9% |
| Memory size | 7.9 KiB |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 9.4057085 |
| Min length | 8 |
Unique
| Unique | 73 ? |
|---|---|
| Unique (%) | 7.4% |
Sample
| 1st row | 14.0 kmpl |
|---|---|
| 2nd row | 21.5 kmpl |
| 3rd row | 12.9 kmpl |
| 4th row | 25.1 kmpl |
| 5th row | 16.5 kmpl |
| Value | Count | Frequency (%) |
| kmpl | 972 | |
| 18.6 | 23 | 1.2% |
| 18.9 | 22 | 1.1% |
| 21.1 | 22 | 1.1% |
| 19.7 | 21 | 1.1% |
| 16.1 | 17 | 0.9% |
| 17.0 | 16 | 0.8% |
| 12.8 | 16 | 0.8% |
| 18.2 | 15 | 0.8% |
| 22.74 | 15 | 0.8% |
| Other values (225) | 823 |
Most occurring characters
| Value | Count | Frequency (%) |
| k | 990 | |
| . | 981 | |
| 981 | ||
| m | 981 | |
| p | 972 | |
| l | 972 | |
| 1 | 783 | |
| 2 | 652 | |
| 0 | 271 | 2.9% |
| 5 | 251 | 2.7% |
| Other values (8) | 1393 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9227 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| k | 990 | |
| . | 981 | |
| 981 | ||
| m | 981 | |
| p | 972 | |
| l | 972 | |
| 1 | 783 | |
| 2 | 652 | |
| 0 | 271 | 2.9% |
| 5 | 251 | 2.7% |
| Other values (8) | 1393 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9227 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| k | 990 | |
| . | 981 | |
| 981 | ||
| m | 981 | |
| p | 972 | |
| l | 972 | |
| 1 | 783 | |
| 2 | 652 | |
| 0 | 271 | 2.9% |
| 5 | 251 | 2.7% |
| Other values (8) | 1393 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9227 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| k | 990 | |
| . | 981 | |
| 981 | ||
| m | 981 | |
| p | 972 | |
| l | 972 | |
| 1 | 783 | |
| 2 | 652 | |
| 0 | 271 | 2.9% |
| 5 | 251 | 2.7% |
| Other values (8) | 1393 |
engine
Text
Missing
| Distinct | 88 |
|---|---|
| Distinct (%) | 9.0% |
| Missing | 19 |
| Missing (%) | 1.9% |
| Memory size | 7.9 KiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.8236493 |
| Min length | 6 |
Unique
| Unique | 19 ? |
|---|---|
| Unique (%) | 1.9% |
Sample
| 1st row | 2498 CC |
|---|---|
| 2nd row | 1497 CC |
| 3rd row | 1799 CC |
| 4th row | 1498 CC |
| 5th row | 1172 CC |
| Value | Count | Frequency (%) |
| cc | 981 | |
| 1248 | 116 | 5.9% |
| 1197 | 105 | 5.4% |
| 796 | 63 | 3.2% |
| 998 | 57 | 2.9% |
| 1396 | 51 | 2.6% |
| 2179 | 49 | 2.5% |
| 1498 | 47 | 2.4% |
| 2494 | 32 | 1.6% |
| 1199 | 31 | 1.6% |
| Other values (79) | 430 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 1962 | |
| 981 | ||
| 1 | 959 | |
| 9 | 855 | |
| 4 | 386 | 5.8% |
| 8 | 366 | 5.5% |
| 2 | 345 | 5.2% |
| 7 | 290 | 4.3% |
| 6 | 223 | 3.3% |
| 3 | 147 | 2.2% |
| Other values (2) | 180 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6694 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 1962 | |
| 981 | ||
| 1 | 959 | |
| 9 | 855 | |
| 4 | 386 | 5.8% |
| 8 | 366 | 5.5% |
| 2 | 345 | 5.2% |
| 7 | 290 | 4.3% |
| 6 | 223 | 3.3% |
| 3 | 147 | 2.2% |
| Other values (2) | 180 | 2.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6694 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 1962 | |
| 981 | ||
| 1 | 959 | |
| 9 | 855 | |
| 4 | 386 | 5.8% |
| 8 | 366 | 5.5% |
| 2 | 345 | 5.2% |
| 7 | 290 | 4.3% |
| 6 | 223 | 3.3% |
| 3 | 147 | 2.2% |
| Other values (2) | 180 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6694 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 1962 | |
| 981 | ||
| 1 | 959 | |
| 9 | 855 | |
| 4 | 386 | 5.8% |
| 8 | 366 | 5.5% |
| 2 | 345 | 5.2% |
| 7 | 290 | 4.3% |
| 6 | 223 | 3.3% |
| 3 | 147 | 2.2% |
| Other values (2) | 180 | 2.7% |
max_power
Text
Missing
| Distinct | 182 |
|---|---|
| Distinct (%) | 18.6% |
| Missing | 19 |
| Missing (%) | 1.9% |
| Memory size | 7.9 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 7.7787971 |
| Min length | 6 |
Unique
| Unique | 55 ? |
|---|---|
| Unique (%) | 5.6% |
Sample
| 1st row | 112 bhp |
|---|---|
| 2nd row | 108.5 bhp |
| 3rd row | 130 bhp |
| 4th row | 98.6 bhp |
| 5th row | 65 bhp |
| Value | Count | Frequency (%) |
| bhp | 981 | |
| 74 | 43 | 2.2% |
| 88.5 | 28 | 1.4% |
| 47.3 | 24 | 1.2% |
| 81.80 | 24 | 1.2% |
| 67.1 | 22 | 1.1% |
| 46.3 | 21 | 1.1% |
| 88.73 | 20 | 1.0% |
| 88.7 | 20 | 1.0% |
| 70 | 19 | 1.0% |
| Other values (173) | 760 |
Most occurring characters
| Value | Count | Frequency (%) |
| 981 | ||
| b | 981 | |
| h | 981 | |
| p | 981 | |
| . | 617 | |
| 8 | 546 | |
| 1 | 448 | |
| 7 | 422 | 5.5% |
| 6 | 308 | 4.0% |
| 3 | 278 | 3.6% |
| Other values (5) | 1088 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7631 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 981 | ||
| b | 981 | |
| h | 981 | |
| p | 981 | |
| . | 617 | |
| 8 | 546 | |
| 1 | 448 | |
| 7 | 422 | 5.5% |
| 6 | 308 | 4.0% |
| 3 | 278 | 3.6% |
| Other values (5) | 1088 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7631 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 981 | ||
| b | 981 | |
| h | 981 | |
| p | 981 | |
| . | 617 | |
| 8 | 546 | |
| 1 | 448 | |
| 7 | 422 | 5.5% |
| 6 | 308 | 4.0% |
| 3 | 278 | 3.6% |
| Other values (5) | 1088 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7631 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 981 | ||
| b | 981 | |
| h | 981 | |
| p | 981 | |
| . | 617 | |
| 8 | 546 | |
| 1 | 448 | |
| 7 | 422 | 5.5% |
| 6 | 308 | 4.0% |
| 3 | 278 | 3.6% |
| Other values (5) | 1088 |
torque
Text
Missing
| Distinct | 226 |
|---|---|
| Distinct (%) | 23.0% |
| Missing | 19 |
| Missing (%) | 1.9% |
| Memory size | 7.9 KiB |
Length
| Max length | 27 |
|---|---|
| Median length | 25 |
| Mean length | 16.293578 |
| Min length | 5 |
Unique
| Unique | 89 ? |
|---|---|
| Unique (%) | 9.1% |
Sample
| 1st row | 260 Nm at 1800-2200 rpm |
|---|---|
| 2nd row | 260Nm@ 1500-2750rpm |
| 3rd row | 172Nm@ 4300rpm |
| 4th row | 200Nm@ 1750rpm |
| 5th row | 96 Nm at 3000 rpm |
| Value | Count | Frequency (%) |
| 4000rpm | 114 | 5.5% |
| 3500rpm | 97 | 4.7% |
| 200nm | 89 | 4.3% |
| 2000rpm | 83 | 4.0% |
| 1750rpm | 69 | 3.3% |
| 190nm | 67 | 3.2% |
| rpm | 63 | 3.0% |
| 90nm | 52 | 2.5% |
| 3000rpm | 39 | 1.9% |
| 2500rpm | 39 | 1.9% |
| Other values (246) | 1373 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3273 | |
| m | 1949 | |
| 1109 | 6.9% | |
| 1 | 1054 | 6.6% |
| @ | 990 | 6.2% |
| r | 973 | 6.1% |
| p | 973 | 6.1% |
| N | 895 | 5.6% |
| 2 | 860 | 5.4% |
| 5 | 809 | 5.1% |
| Other values (23) | 3099 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 15984 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3273 | |
| m | 1949 | |
| 1109 | 6.9% | |
| 1 | 1054 | 6.6% |
| @ | 990 | 6.2% |
| r | 973 | 6.1% |
| p | 973 | 6.1% |
| N | 895 | 5.6% |
| 2 | 860 | 5.4% |
| 5 | 809 | 5.1% |
| Other values (23) | 3099 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 15984 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3273 | |
| m | 1949 | |
| 1109 | 6.9% | |
| 1 | 1054 | 6.6% |
| @ | 990 | 6.2% |
| r | 973 | 6.1% |
| p | 973 | 6.1% |
| N | 895 | 5.6% |
| 2 | 860 | 5.4% |
| 5 | 809 | 5.1% |
| Other values (23) | 3099 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 15984 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3273 | |
| m | 1949 | |
| 1109 | 6.9% | |
| 1 | 1054 | 6.6% |
| @ | 990 | 6.2% |
| r | 973 | 6.1% |
| p | 973 | 6.1% |
| N | 895 | 5.6% |
| 2 | 860 | 5.4% |
| 5 | 809 | 5.1% |
| Other values (23) | 3099 |
seats
Real number (ℝ)
Missing
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 19 |
| Missing (%) | 1.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.4108053 |
| Minimum | 4 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 5 |
| median | 5 |
| Q3 | 5 |
| 95-th percentile | 7 |
| Maximum | 9 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.91998528 |
|---|---|
| Coefficient of variation (CV) | 0.17002742 |
| Kurtosis | 1.7775742 |
| Mean | 5.4108053 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.6424577 |
| Sum | 5308 |
| Variance | 0.84637292 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=6)
| Value | Count | Frequency (%) |
| 5 | 758 | |
| 7 | 161 | 16.1% |
| 4 | 24 | 2.4% |
| 8 | 23 | 2.3% |
| 6 | 8 | 0.8% |
| 9 | 7 | 0.7% |
| (Missing) | 19 | 1.9% |
| Value | Count | Frequency (%) |
| 4 | 24 | 2.4% |
| 5 | 758 | |
| 6 | 8 | 0.8% |
| 7 | 161 | 16.1% |
| 8 | 23 | 2.3% |
| 9 | 7 | 0.7% |
| Value | Count | Frequency (%) |
| 9 | 7 | 0.7% |
| 8 | 23 | 2.3% |
| 7 | 161 | 16.1% |
| 6 | 8 | 0.8% |
| 5 | 758 | |
| 4 | 24 | 2.4% |
Interactions
Correlations
| fuel | km_driven | owner | seats | seller_type | selling_price | transmission | year | |
|---|---|---|---|---|---|---|---|---|
| fuel | 1.000 | 0.174 | 0.000 | 0.220 | 0.106 | 0.150 | 0.000 | 0.133 |
| km_driven | 0.174 | 1.000 | 0.164 | 0.246 | 0.142 | -0.328 | 0.243 | -0.597 |
| owner | 0.000 | 0.164 | 1.000 | 0.063 | 0.174 | 0.165 | 0.147 | 0.281 |
| seats | 0.220 | 0.246 | 0.063 | 1.000 | 0.028 | 0.291 | 0.039 | 0.016 |
| seller_type | 0.106 | 0.142 | 0.174 | 0.028 | 1.000 | 0.364 | 0.362 | 0.196 |
| selling_price | 0.150 | -0.328 | 0.165 | 0.291 | 0.364 | 1.000 | 0.628 | 0.710 |
| transmission | 0.000 | 0.243 | 0.147 | 0.039 | 0.362 | 0.628 | 1.000 | 0.308 |
| year | 0.133 | -0.597 | 0.281 | 0.016 | 0.196 | 0.710 | 0.308 | 1.000 |
Missing values
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
Sample
| name | year | selling_price | km_driven | fuel | seller_type | transmission | owner | mileage | engine | max_power | torque | seats | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Mahindra Xylo E4 BS IV | 2010 | 229999 | 168000 | Diesel | Individual | Manual | First Owner | 14.0 kmpl | 2498 CC | 112 bhp | 260 Nm at 1800-2200 rpm | 7.0 |
| 1 | Tata Nexon 1.5 Revotorq XE | 2017 | 665000 | 25000 | Diesel | Individual | Manual | First Owner | 21.5 kmpl | 1497 CC | 108.5 bhp | 260Nm@ 1500-2750rpm | 5.0 |
| 2 | Honda Civic 1.8 S AT | 2007 | 175000 | 218463 | Petrol | Individual | Automatic | First Owner | 12.9 kmpl | 1799 CC | 130 bhp | 172Nm@ 4300rpm | 5.0 |
| 3 | Honda City i DTEC VX | 2015 | 635000 | 173000 | Diesel | Individual | Manual | First Owner | 25.1 kmpl | 1498 CC | 98.6 bhp | 200Nm@ 1750rpm | 5.0 |
| 4 | Tata Indica Vista Aura 1.2 Safire BSIV | 2011 | 130000 | 70000 | Petrol | Individual | Manual | Second Owner | 16.5 kmpl | 1172 CC | 65 bhp | 96 Nm at 3000 rpm | 5.0 |
| 5 | Mahindra Thar CRDe | 2019 | 975000 | 12584 | Diesel | Dealer | Manual | First Owner | 16.55 kmpl | 2498 CC | 105 bhp | 247Nm@ 1800-2000rpm | 6.0 |
| 6 | Chevrolet Spark 1.0 LS | 2011 | 150000 | 35000 | Petrol | Individual | Manual | First Owner | 18.0 kmpl | 995 CC | 62 bhp | 90.3Nm@ 4200rpm | 5.0 |
| 7 | Maruti Ritz ZXi | 2012 | 275000 | 70000 | Petrol | Individual | Manual | Second Owner | 18.5 kmpl | 1197 CC | 85.80 bhp | 114Nm@ 4000rpm | 5.0 |
| 8 | Maruti Alto LX | 2011 | 140000 | 72000 | Petrol | Individual | Manual | Second Owner | 19.7 kmpl | 796 CC | 46.3 bhp | 62Nm@ 3000rpm | 5.0 |
| 9 | Hyundai Creta 1.6 CRDi SX | 2016 | 850000 | 58000 | Diesel | Individual | Manual | First Owner | 19.67 kmpl | 1582 CC | 126.2 bhp | 259.9Nm@ 1900-2750rpm | 5.0 |
| name | year | selling_price | km_driven | fuel | seller_type | transmission | owner | mileage | engine | max_power | torque | seats | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 990 | Maruti Alto LXi | 2007 | 95000 | 70000 | Petrol | Individual | Manual | Second Owner | 19.7 kmpl | 796 CC | 46.3 bhp | 62Nm@ 3000rpm | 5.0 |
| 991 | Honda Brio V MT | 2012 | 376000 | 26000 | Petrol | Individual | Manual | First Owner | 19.4 kmpl | 1198 CC | 86.8 bhp | 109Nm@ 4500rpm | 5.0 |
| 992 | Maruti Alto LXi | 2006 | 85000 | 150000 | Petrol | Individual | Manual | Second Owner | 19.7 kmpl | 796 CC | 46.3 bhp | 62Nm@ 3000rpm | 5.0 |
| 993 | Maruti 800 DX | 1999 | 52000 | 100000 | Petrol | Individual | Manual | First Owner | 16.1 kmpl | 796 CC | 37 bhp | 59Nm@ 2500rpm | 4.0 |
| 994 | Maruti Swift Dzire VXi | 2010 | 240000 | 143000 | Petrol | Individual | Manual | First Owner | 17.5 kmpl | 1298 CC | 85.8 bhp | 114Nm@ 4000rpm | 5.0 |
| 995 | Hyundai i10 Magna 1.1L | 2008 | 250000 | 100000 | Petrol | Individual | Manual | Second Owner | 19.81 kmpl | 1086 CC | 68.05 bhp | 99.04Nm@ 4500rpm | 5.0 |
| 996 | Hyundai i20 2015-2017 Sportz 1.2 | 2017 | 440000 | 50000 | Petrol | Individual | Manual | Second Owner | 18.6 kmpl | 1197 CC | 81.83 bhp | 114.7Nm@ 4000rpm | 5.0 |
| 997 | Hyundai i20 Era Diesel | 2009 | 340000 | 40000 | Diesel | Individual | Manual | First Owner | 23.0 kmpl | 1396 CC | 90 bhp | 22.4 kgm at 1750-2750rpm | 5.0 |
| 998 | Hyundai i10 Asta | 2012 | 350000 | 25000 | Petrol | Individual | Manual | First Owner | 20.36 kmpl | 1197 CC | 78.9 bhp | 111.8Nm@ 4000rpm | 5.0 |
| 999 | Honda City i DTec SV | 2016 | 700000 | 110000 | Diesel | Individual | Manual | First Owner | 26.0 kmpl | 1498 CC | 98.6 bhp | 200Nm@ 1750rpm | 5.0 |
Duplicate rows
Most frequently occurring
| name | year | selling_price | km_driven | fuel | seller_type | transmission | owner | mileage | engine | max_power | torque | seats | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2 | Honda Jazz VX | 2016 | 550000 | 56494 | Petrol | Trustmark Dealer | Manual | First Owner | 18.2 kmpl | 1199 CC | 88.7 bhp | 110Nm@ 4800rpm | 5.0 | 8 |
| 9 | Jaguar XF 2.0 Diesel Portfolio | 2017 | 3200000 | 45000 | Diesel | Dealer | Automatic | First Owner | 19.33 kmpl | 1999 CC | 177 bhp | 430Nm@ 1750-2500rpm | 5.0 | 6 |
| 28 | Toyota Camry 2.5 Hybrid | 2016 | 2000000 | 68089 | Petrol | Trustmark Dealer | Automatic | First Owner | 19.16 kmpl | 2494 CC | 157.7 bhp | 213Nm@ 4500rpm | 5.0 | 6 |
| 31 | Volvo V40 D3 R-Design | 2018 | 2475000 | 2000 | Diesel | Dealer | Automatic | First Owner | 16.8 kmpl | 1984 CC | 150 bhp | 350Nm@ 1500-2750rpm | 5.0 | 6 |
| 1 | BMW X4 M Sport X xDrive20d | 2019 | 5500000 | 8500 | Diesel | Dealer | Automatic | First Owner | 16.78 kmpl | 1995 CC | 190 bhp | 400Nm@ 1750-2500rpm | 5.0 | 4 |
| 17 | Maruti Swift AMT VVT VXI | 2019 | 650000 | 5621 | Petrol | Trustmark Dealer | Automatic | First Owner | 22.0 kmpl | 1197 CC | 81.80 bhp | 113Nm@ 4200rpm | 5.0 | 4 |
| 23 | Skoda Rapid 1.6 MPI AT Elegance | 2016 | 645000 | 11000 | Petrol | Dealer | Automatic | First Owner | 14.3 kmpl | 1598 CC | 103.5 bhp | 153Nm@ 3800rpm | 5.0 | 4 |
| 25 | Tata Safari Storme EX | 2015 | 503000 | 110000 | Diesel | Individual | Manual | First Owner | 14.1 kmpl | 2179 CC | 147.94 bhp | 320Nm@ 1500-3000rpm | 7.0 | 4 |
| 4 | Hyundai Grand i10 1.2 CRDi Sportz | 2017 | 450000 | 56290 | Diesel | Dealer | Manual | First Owner | 24.0 kmpl | 1186 CC | 73.97 bhp | 190.24nm@ 1750-2250rpm | 5.0 | 3 |
| 10 | Lexus ES 300h | 2019 | 5150000 | 20000 | Petrol | Dealer | Automatic | First Owner | 22.37 kmpl | 2487 CC | 214.56 bhp | 202Nm@ 3600-5200rpm | 5.0 | 3 |